Geometric Methods for Optical Character Recognition a Dissertation Presented by Abstract of the Dissertation Geometric Methods for Optical Character Recognition
نویسندگان
چکیده
of the Dissertation Geometric Methods for Optical Character Recognition by George N. Sazaklis Doctor of Philosophy in Computer Science State University of New York at Stony Brook Advisor: Joseph S. B. Mitchell 1997 Abstract Optical Character Recognition (OCR) is an important problem having both theoretical and practical interest. In this dissertation, we present solutions to three problems within the area of OCR. A di culty encountered by many OCR systems is confusions between similar shapes, when exible matching is employed as a primary recognition mechanism. Our solution, constrained matching as a second stage classi cation technique, can discriminate between similar shapes, using shape geometric attributes; thus the system is enabled to reach a nal decision on the character identity. Another important problem in OCR is the fast and reliable xed-font recognition. We present a hierarchical classi cation technique that utilizes the concept of geometric probe trees from [4]. At each node of the probe tree, a geometric probe collects information from the shape at hand, and makes a partial decision about its identity, eliminating certain candidates from further consideration. The probe tree can be constructed o -line in a preprocessing step and can provide us with high speed recognition for a xed font. ii We also present an extension of geometric probes, the pre x probing technique that solves the important practical problem of touching characters for a xed font. Pre x probing is based on a forest of probe trees that succeeds in identi ing each member from a sequence of touching characters independently of its neighbors. This avoids the excessive demands of a straightforward solution. To substantiate our methods, we have developed several tools, and we have done experiments with scanned documents, so we also present our experimental results. iii
منابع مشابه
Recognition of Characters in Scenes by Using Geometric Contexts of Local Features
If you can get useful information from a character image in scene you take with your camera, all you have to do is release the shutter and you can save time. In order to realize such a system, we have to propose a character recognition system such that it can correctly find the character area from the scene image and recognize it. In this paper, we propose two methods to achieve it, which consi...
متن کاملImage Preprocessing For Geometric Feature Extraction in OCR Systems
Optical character recognition (OCR) is one of the most successful application of pattern recognition and image processing. Character geometry is one of the most useful feature for identifying characters in images. The geometric feature extraction techniques proposed in literature are complex and requires extensive effort in implementation. In this paper, we propose a preprocessing technique whi...
متن کاملAutomated Labeling from Biomedical Journals published in Foreign Languages
An automated labeling (AL) module is developed to produce bibliographic records such as English title, vernacular title, author, affiliation, and English abstract from biomedical articles published in foreign language journals. Optical character recognition (OCR) output from scanned biomedical journals is used in this labeling process. Since frequently occurring words in a zone are important fe...
متن کاملTranscript mapping for handwritten Chinese documents by integrating character recognition model and geometric context
Creating document image datasets with ground-truths of regions, text lines and characters is a prerequisite for document analysis research. However, ground-truthing large datasets is not only laborious and time consuming but also prone to errors due to the difficulty of character segmentation and the large variability of character shape, size and position. This paper describes an effective reco...
متن کاملGeometric Probing and Testing - A Survey
Geometric probing is the area of computational geometry that studies how to identify, verify, or determine some property of an unknown geometric object using a measuring device known as a probe. It has applications in the areas of robotics, automated manufacturing, computer vision, optical character recognition and tomography. Geometric testing is the subarea of geometric probing that studies t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997